art conversational model
r/MachineLearning - [P] DialogPT: State of the Art Conversational Model with Fine-Tuned GPT-2 (Microsoft Research)
I've managed to get the model running generation on my PC. One thing needed to point out is that the checkpoint can NOT be loaded exactly as the GPT-2 model checkpoint from Huggingface pytorch-transformer repository. You'll also need to manually define the config, e.g. The generation works just fine by a nucleus sampling approach, and once in a while an E-O-T will be given to indicate end of one post. Bot: they're having an open gym soon in June... dont think they'll be up there this time though Bot: So what's this gym called?